655 research outputs found

    Rethinking the Evaluation of Unbiased Scene Graph Generation

    Full text link
    Since the severe imbalanced predicate distributions in common subject-object relations, current Scene Graph Generation (SGG) methods tend to predict frequent predicate categories and fail to recognize rare ones. To improve the robustness of SGG models on different predicate categories, recent research has focused on unbiased SGG and adopted mean Recall@K (mR@K) as the main evaluation metric. However, we discovered two overlooked issues about this de facto standard metric mR@K, which makes current unbiased SGG evaluation vulnerable and unfair: 1) mR@K neglects the correlations among predicates and unintentionally breaks category independence when ranking all the triplet predictions together regardless of the predicate categories, leading to the performance of some predicates being underestimated. 2) mR@K neglects the compositional diversity of different predicates and assigns excessively high weights to some oversimple category samples with limited composable relation triplet types. It totally conflicts with the goal of SGG task which encourages models to detect more types of visual relationship triplets. In addition, we investigate the under-explored correlation between objects and predicates, which can serve as a simple but strong baseline for unbiased SGG. In this paper, we refine mR@K and propose two complementary evaluation metrics for unbiased SGG: Independent Mean Recall (IMR) and weighted IMR (wIMR). These two metrics are designed by considering the category independence and diversity of composable relation triplets, respectively. We compare the proposed metrics with the de facto standard metrics through extensive experiments and discuss the solutions to evaluate unbiased SGG in a more trustworthy way

    Uncertainty Quantification of Collaborative Detection for Self-Driving

    Full text link
    Sharing information between connected and autonomous vehicles (CAVs) fundamentally improves the performance of collaborative object detection for self-driving. However, CAVs still have uncertainties on object detection due to practical challenges, which will affect the later modules in self-driving such as planning and control. Hence, uncertainty quantification is crucial for safety-critical systems such as CAVs. Our work is the first to estimate the uncertainty of collaborative object detection. We propose a novel uncertainty quantification method, called Double-M Quantification, which tailors a moving block bootstrap (MBB) algorithm with direct modeling of the multivariant Gaussian distribution of each corner of the bounding box. Our method captures both the epistemic uncertainty and aleatoric uncertainty with one inference pass based on the offline Double-M training process. And it can be used with different collaborative object detectors. Through experiments on the comprehensive collaborative perception dataset, we show that our Double-M method achieves more than 4X improvement on uncertainty score and more than 3% accuracy improvement, compared with the state-of-the-art uncertainty quantification methods. Our code is public on https://coperception.github.io/double-m-quantification.Comment: 6 pages, 3 figure

    Signal processing and generation of bioactive nitric oxide in a model prototissue

    Get PDF
    The design and construction of synthetic prototissues from integrated assemblies of artificial protocells is an important challenge for synthetic biology and bioengineering. Here we spatially segregate chemically communicating populations of enzyme-decorated phospholipid-enveloped polymer/DNA coacervate protocells in hydrogel modules to construct a tubular prototissue-like vessel capable of modulating the output of bioactive nitric oxide (NO). By decorating the protocells with glucose oxidase, horseradish peroxidase or catalase and arranging different modules concentrically, a glucose/hydroxyurea dual input leads to logic-gate signal processing under reaction-diffusion conditions, which results in a distinct NO output in the internal lumen of the model prototissue. The NO output is exploited to inhibit platelet activation and blood clot formation in samples of plasma and whole blood located in the internal channel of the device, thereby demonstrating proof-of-concept use of the prototissue-like vessel for anticoagulation applications. Our results highlight opportunities for the development of spatially organized synthetic prototissue modules from assemblages of artificial protocells and provide a step towards the organization of biochemical processes in integrated micro-compartmentalized media, micro-reactor technology and soft functional materials

    Prediction of phosphotyrosine signaling networks using a scoring matrix-assisted ligand identification approach

    Get PDF
    Systematic identification of binding partners for modular domains such as Src homology 2 (SH2) is important for understanding the biological function of the corresponding SH2 proteins. We have developed a worldwide web-accessible computer program dubbed SMALI for scoring matrix-assisted ligand identification for SH2 domains and other signaling modules. The current version of SMALI harbors 76 unique scoring matrices for SH2 domains derived from screening oriented peptide array libraries. These scoring matrices are used to search a protein database for short peptides preferred by an SH2 domain. An experimentally determined cut-off value is used to normalize an SMALI score, therefore allowing for direct comparison in peptide-binding potential for different SH2 domains. SMALI employs distinct scoring matrices from Scansite, a popular motif-scanning program. Moreover, SMALI contains built-in filters for phosphoproteins, Gene Ontology (GO) correlation and colocalization of subject and query proteins. Compared to Scansite, SMALI exhibited improved accuracy in identifying binding peptides for SH2 domains. Applying SMALI to a group of SH2 domains identified hundreds of interactions that overlap significantly with known networks mediated by the corresponding SH2 proteins, suggesting SMALI is a useful tool for facile identification of signaling networks mediated by modular domains that recognize short linear peptide motifs

    Failure Mechanism of Bolts and Countermeasures in Swelling Soft Rock Support

    Get PDF
    The effect of conventional bolt support is not ideal due to the large deformation character of soft rock. As an innovative bolt, constant resistance large deformation (CRLD) bolt has been successfully applied to swelling soft rock engineering, but the reinforcement mechanism is not yet clear. To investigate the interaction mechanism between bolt and surrounding rock, Nanshan Coal Mine of China was selected as the engineering background. The plastic zone of surrounding rock mass and the axial force of three bolts were obtained by theoretical analysis and FLAC3D numerical simulation. Failure processes of conventional pretension bolts in soft rock were reproduced, and the interaction between CRLD bolt and soft rock was investigated in comparison. The results indicate that: (1) The fracture zone of surrounding rock exceeds the anchorage range of the low pretension bolt, the bolt slides with surrounding rock and finally fails. (2) The fracture zone of surrounding rock does not exceed the anchorage range of the high pretension bolt. However, with the accumulation of deformation energy, stress concentration makes the bolt break. (3) CRLD bolt can effectively absorb the deformation energy released by soft rock and maintain constant support resistance. The conclusions obtained in this study provide significant references in the selection of bolts in soft rock engineering

    MMBench: Is Your Multi-modal Model an All-around Player?

    Full text link
    Large vision-language models have recently achieved remarkable progress, exhibiting great perception and reasoning abilities concerning visual information. However, how to effectively evaluate these large vision-language models remains a major obstacle, hindering future model development. Traditional benchmarks like VQAv2 or COCO Caption provide quantitative performance measurements but suffer from a lack of fine-grained ability assessment and non-robust evaluation metrics. Recent subjective benchmarks, such as OwlEval, offer comprehensive evaluations of a model's abilities by incorporating human labor, but they are not scalable and display significant bias. In response to these challenges, we propose MMBench, a novel multi-modality benchmark. MMBench methodically develops a comprehensive evaluation pipeline, primarily comprised of two elements. The first element is a meticulously curated dataset that surpasses existing similar benchmarks in terms of the number and variety of evaluation questions and abilities. The second element introduces a novel CircularEval strategy and incorporates the use of ChatGPT. This implementation is designed to convert free-form predictions into pre-defined choices, thereby facilitating a more robust evaluation of the model's predictions. MMBench is a systematically-designed objective benchmark for robustly evaluating the various abilities of vision-language models. We hope MMBench will assist the research community in better evaluating their models and encourage future advancements in this domain. Project page: https://opencompass.org.cn/mmbench
    • ā€¦
    corecore